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5 CROSS REFERENCE TO RELATED PATENT APPLICATION 

This application claims priority to United States Provisional Application No. 
60/427,906 (Attorney Docket No. KLA1P099P), filed November 19, 2002, entitled VOID 
CHARACTERIZATION IN METAL INTERCONNECT STRUCTURES USING X-RAY 
EMISSION ANALYSES, which application is incorporated herein by reference it its 
10 entirety for all purposes. 

FIELD OF THE INVENTION 

The present invention relates generally to methods for characterizing voids within 
conductive interconnect structures of semiconductor devices. More specifically, the 
15 invention pertains to using X-ray emission analysis techniques to characterize voids within 
metal vias, contacts and lines of metal interconnect structures. 

BACKGROUND OF THE INVENTION 

Voids within metal interconnect structures, such as those in integrated circuits, can 
have different sizes, distributions and locations within the interconnect structures. For 
20 example, voids can range from "monster" voids occluding whole lines, vias or contacts with 
immediate yield impact, to small voids within lines, vias or contacts that impact the long- 
term reliability of the device. Voids can be uniformly distributed within a line, via or 
contact section or they can be localized in deep or shallow portions of a line, via or contact. 

Void formation can stem from a number of different root causes. For example, a 
25 void can form during a metal film deposition process where there is poor step coverage, 
typically resulting in a slit shaped void in the middle of via or contact regions. A void can 
also form from the scratching of metal surfaces during chemical mechanical polishing 
(CMP) with subsequent metal deposition, typically resulting in a shallow, long void in the 
upper portion of a via and line. Other examples include voids caused during the high 
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temperature annealing process of metal film and during trench and via etching. If one could 
characterize the size, distribution and location of voids, one could gain information about the 
root causes of the voids. 

Traditionally, the only way to know the size, distribution and location of voids is by 
using destructive methods such as Focused Ion Beam (FIB) cross-sectioning techniques. 
Although these techniques can help verify the existence of voids, the sample is destroyed 
during the analysis. This is expensive for integrated circuit manufacturers since it means 
sacrificing product samples from the product line and often results in wasting product found 
to have no significant defects. Furthermore, FIB techniques are time consuming, which can 
cause downtime of the product line production. This can be an especially big problem if 
numerous scans on numerous wafers are needed to ensure void-free processing. 

Accordingly, there is a need to characterize voids that may reside within integrated 
circuit product lines efficiently and without sacrificing valuable product. 
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SUMMARY 



In general terms, the present invention provides methods and apparatus for 
characterizing a potential void or voids by analyzing the X-ray count of one or more emitted 
X-ray species as emitted from an interconnect structure under test in response to a impinging 
beam, such as an electron beam, directed towards the sample surface. For example, this 
analysis may be used to determine whether the structure (e.g., a contact, line or via) has one 
or more void(s). It may also be used to help determine where the void(s) are with respect to 
the interconnect structure. It may also be used to help determine other characteristics of the 
void(s) with respect to the interconnect structure such as the distribution and size(s) of the 
void(s). The analysis may also be used to help initially determine whether the structure 
under test is so out of specification that it cannot then be determined whether the structure 
has a defect of a particular type. 

By directing a beam of appropriate energy toward the sample surface, a certain 
volume of the sample will generate X-rays. These X-rays will radiate from the volume 
within the sample and some of the X-rays will be emitted from the sample surface. These 
emitted X-rays are characteristic of composition of the sample. If the sample is comprised 
of an interconnect structure, such as a semiconductor device, the X-rays emitted from the 
sample will be characteristic of the interconnect structure, as well as the surrounding 
material. 

In one embodiment, the sample is a copper semiconductor device surrounded by 
dielectric material. By detecting and collecting the copper K and copper L X-rays emitted 
from the sample, and by calculating various ratios of the intensities of emitted copper K and 
copper L X-rays, one can characterize the interconnect structure with respect to how deeply 
each of the X-rays penetrate the sample. Thus, by sampling another similar sample, but with 
void(s), one can determine where the void(s) are, how the void(s) are distributed and what 
size the void(s) are with respect to the copper interconnect structure. These techniques are 
described in detail further. One can also detect and collect X-ray emission from other 
elemental species emitted from the sample to implement similar analyses. 

To improve the reliability of the results of these techniques, one can scan a number 
of samples, thereby allowing statistical analyses. These statistical analyses can include 
taking absolute values of the emitted X-rays, calculating the mean of X-ray emission for the 
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various scans and calculating the standard deviations of X-ray emission for the various 
scans. One can also improve the reliability of results by including the X-ray emission data 
of species that are not part of the interconnect structure. For example, if a copper 
interconnect structure is surrounded by silicon dioxide, one can detect and collect the silicon 
Ka emission from the sample and use it to emphasize the variations of the Cu La and Cu Ka 
X-rays with respect to the copper interconnect surroundings. 

In a specific implementation, the characterizing operation is based on collecting the 
characteristics X-rays of a first "non- voided" sample, collecting the characteristic X-rays of 
a second "known voided" sample, collecting the characteristic X-rays of third "unknown" 
sample and comparing the X-ray data from the three samples to determine the void 
characteristics of the third "unknown" sample. This is done by establishing rules for 
characteristic X-ray parameters and "binning" the X-ray data into a database. These 
techniques are described in detail further. 

In one embodiment, the beam directed toward the sample to cause X-ray emission 
from the sample is an electron beam. In one aspect, the electron beam is preferably tightly 
focused, in that it is sufficiently focused to resolve X-ray emission differences between small 
features of interest in current semiconductor devices. In another aspect, the electron beam 
has preferably high enough energy to travel through the different layers of a typical 
semiconductor film stack. 

In one embodiment, the electron beam can be rastered over a region of the sample so 
as to ensure sampling of the entire structure of interest. The rastering can also provide for 
more X-ray data to be collected over the structure of interest. 

In another embodiment, the beam directed toward the sample to cause X-ray 
emission from the sample is a focused ion beam. In one aspect, the focused ion beam is 
sufficiently focused to resolve X-ray emission differences between small features of interest 
in current semiconductor devices. In another aspect, the focused ion beam has preferably 
high enough energy to travel through the different layers of a typical semiconductor film 
stack. 

These and other features of the present invention will be presented in more detail in 

the following specification of the invention and the accompanying figures, which illustrate 

by way of example the principles of the invention. 
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BRIEF DESCRIPTION OF THE DRAWINGS 



FIG. 1 A is a flowchart illustrating a procedure for characterizing voids in conductive 
interconnect structures of a semiconductor devices in accordance with one embodiment of 
the present invention. 

5 FIG. IB is a flowchart illustrating the operation of FIG. 1A of establishing rules for 

the characterization of voids in accordance with one embodiment of the present invention. 

FIG. 1C outlines a procedure for characterizing voids of a target structure based on a 
set of rules established by analyzing X-ray data taken from a plurality of reference structures 
and known voided structures in accordance with one embodiment of the present invention. 

10 FIG. 2 A illustrates a cross section of copper-cobalt alloy sample with an electron 

beam incident upon a sample showing interaction volume and sampling volume within the 
sample. 

FIG. 2B illustrates a cross section of a semiconductor sample with an electron beam 
incident upon a sample, resulting in X-rays being emitted by various layers of the sample 
15 material. 

FIG. 3A illustrates a cross section of a semiconductor sample with an electron beam 
incident upon an upper portion of a recessed conductive region of the specimen, resulting in 
strong Cu Ka X-ray emission from the sample surface. 

FIG. 3B illustrates a cross section of a semiconductor sample with an electron beam 
20 incident upon an upper portion of a recessed conductive region of the specimen, resulting in 
strong Cu La X-ray emission from the sample surface. 

FIG. 3C illustrates a cross section of a semiconductor sample with an electron beam 
incident upon a lower portion of a recessed conductive region of the specimen, resulting in 
some Cu Ka X-ray emission from the sample surface. 

25 FIG. 3D illustrates a cross section of a semiconductor sample with an electron beam 

incident upon a lower portion of a recessed conductive region of the specimen, resulting in 
very little Cu La X-ray emission from the sample surface. 
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FIG. 4 illustrates one top view and two side views of a semiconductor sample 
containing copper lines and vias and shows some dimensional parameters of the interconnect 
structure. 

FIG. 5 is a diagram of a Bohr model of an atom showing electronic transitions during 
Auger electron emission and X-ray emission competing processes. 

FIG. 6 is an X-ray emission spectrum from a copper semiconductor integrated device 

sample. 

FIG. 7 is a line graph showing the percent of Cu Ka and Cu La X-ray emissions as a 
function of emission depth within a sample. 

FIG. 8 shows cross section views of copper trenches containing voids in different 
locations within the trenches and a corresponding table showing the effects of the different 
void locations on various Cu La, Cu Ka and Si Ka ratios. 

FIG. 9 shows cross section views of different types of voids in complex interconnect 
copper structures and a corresponding table showing the effects of the different void types 
on various Cu La, Cu Ka and Si Ka ratios. 

FIG. 10 illustrates an electron beam induced X-ray microanalysis test system 
according to one embodiment of the present invention. 
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DETAILED DESCRIPTION OF SPECIFIC EMBODIMENTS 
INTRODUCTION 



Reference will now be made in detail to a specific embodiment of the invention. An 
example of this embodiment is illustrated in the accompanying drawings. While the 

5 invention will be described in conjunction with this specific embodiment, it will be 
understood that it is not intended to limit the invention to one embodiment. On the contrary, 
it is intended to cover alternatives, modifications, and equivalents as may be included within 
the spirit and scope of the invention as defined by the appended claims. In the following 
description, numerous specific details are set forth in order to provide a thorough 

10 understanding of the present invention. The present invention may be practiced without 
some or all of these specific details. In other instances, well known process operations have 
not been described in detail in order not to unnecessarily obscure the present invention. 

In general terms, X-ray emission analyses involve characterizing the composition of 
a sample by exciting the atomic core electrons within a sample and analyzing the resulting 

15 emitted X-rays. In the present invention, excitement of the atomic core electrons is achieved 
by bombarding the sample with a focused electron beam (e-beam), although other techniques 
of inducing X-ray emission such as a focused ion beam can be used. Upon sample 
bombardment, a transfer of energy occurs which excite the atomic core electrons into 
different electronic energy levels. Once in this excited state, the atoms have two possible 

20 modes of relaxation: emission of X-rays, or emission of Auger electrons. 

To illustrate these two possibilities, Figure 5 illustrates a Bohr model of an atom with 
three electronic energy levels K, L and M (505, 515 and 517, respectively), with electrons in 
K having greater electronic binding energy than those in L, and electrons in L having greater 
binding energy than those in M. An incident electron 501 strikes an atom with enough 

25 energy to displace an atomic core electron 503, causing the ejection of a secondary electron 
519 and producing a core hole or vacancy 507. With the vacancy in the core energy level, 
the atom is energetically unstable. The most probable stabilization mechanism is filling the 
vacancy with another electron in a higher energy level 509. That is, a second electron falls 
from a higher level into the vacancy with release of energy. The resulting energy may then 

30 be carried off by one of two mechanisms: Auger electron emission or X-ray emission. In 
Auger electron emission, the resulting energy is carried off when an Auger electron 5 1 1 
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from a higher energy level is ejected. In X-ray emission, the resulting energy is carried off 
in the form of emitted X-rays 513, leaving an ionized atom. Auger electron emission and X- 
ray emission are competitive processes. 

The present invention pertains to the detection and analysis of X-ray emissions of a 
5 sample in accordance with above description. Each element has its own characteristic 
electronic energy configurations and its own characteristic X-ray emissions. For example, 
copper has two dominant characteristic X-ray emissions: a La emission emitted when an 
electron falls from the M electronic energy level into the L electronic energy level to fill the 
vacancy resulting from the L shell ionization, and a Ka emission formed when an electron 
10 falls from L electronic energy level to K electronic energy level to fill the vacancy resulting 
from K shell ionization. An energy spectrum of detected X-rays emitted from a sample 
containing different elements shows peaks characteristic of the elements present in the 
sample. The ratios of the intensities of X-ray peaks can provide a quantitative determination 
of surface and subsurface composition. 

15 For example, Figure 6 shows an X-ray emission spectrum from a copper interconnect 

structure in an integrated circuit. The horizontal axis represents energy in keV and the 
vertical axis is the relative intensity. The oxygen Ka (O Ka) peak 601 has an energy of 
about 0.525 keV. The copper La (Cu La) peak 603 has a larger intensity than O Ka and has 
an energy of about 0.93 keV. The silicon (Si Ka) peak 605 has an energy of about 1.74 keV. 

20 The copper Ka peak (Cu Ka) 607 has a lower intensity than the Cu La and Si Ka peaks; it 
has an energy of about 8.04 keV. The small intensity peak to the right of 607 is copper Kp 
peak (Cu Kp), which has an energy of about 8.90 keV. The relative intensities of the peaks 
in this X-ray spectrum can be compared to spectra of the pure elements or other samples of 
known composition to determine the quantitative composition of elements in the sample. 

25 In a preferred embodiment of the invention, X-ray emission is induced with an e- 

beam. In general, when an e-beam bombards the surface of a sample, a portion of the 
sample, the interaction volume, is determined by the kinetic energy of the bombarding 
electrons. While not wishing to be bound by theory, the shape of the interaction volume is 
often described as "teardrop" or "pear" for those skilled in the art. It is worth noting, 

30 however, that the interaction volume for any particular sampling can vary from a vertically 

elongated teardrop, to a hemisphere-like teardrop, to a horizontally squeezed teardrop, 

depending on the composition of the sample and the e-beam energy. For simplicity and 
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clarity, discussions and illustrations herein will describe interaction volumes as "teardrop" 
shaped to help explain certain aspects of the invention. It should be noted, however, that the 
present invention is not limited to any particular shape or size of interaction volume. 

As a result of the e-beam 201 bombardment, regions of sample 203 will 
electronically excite atoms within the sample 203 and generate X-rays characteristic of these 
atomic elements. The regions that generate X-rays are referred to as x-ray excitation 
volumes. In Figure 2A, for example, Cu La excitation volume 207 and Cu Ka excitation 
volume 209 are depicted. Note that Cu La x-ray excitation volume 207 and Cu Ka x-ray 
excitation volume 209 are smaller than interaction volume 205. 

Once X-rays generated in Cu La excitation volume 207 and Cu Ka excitation volume 
209, some of the generated X-rays have enough energy to penetrate through the thickness of 
the sample 203 towards the surface and are emitted from the surface of the sample 203. 
These emitted X-rays (not shown) can be detected by a detector (not shown) and analyzed, 
for example, by X-ray emission spectroscopy techniques as described above. Note that due 
to self-absorption, not all the X-rays generated within sample 203 have enough energy to 
penetrate though the thickness of the sample 203 to reach the surface and thereby be detected 
by a detector. The x-ray emission that escapes from the surface and reaches the detector is 
often from a smaller region than the x-ray excitation volume. This smaller volume is called 
x-ray sampling volume. 

For a comprehensive discussion regarding concepts of electron-material interaction, 
interaction volume and X-ray sampling volume, and x-ray excitation volume can be found, 
refer to the book Scanning Electron Microscopy and X-ray Microanalysis, Second Edition, 
Joseph L Goldstein, et al. Plenum Press, New York and London, 1992. 

As mentioned previously, the present invention is particularly useful for analyzing 
voids within integrated circuits. For an integrated circuit sample containing dielectric 
material such as silicon dioxide and regions of metal conductor such as copper vias or lines, 
For example, Figure 2B illustrates a cross section view of a focused e-beam 201 bombarding 
a sample integrated circuit stack 203 composed of layers of copper 205 and dielectric 
material 207 over a silicon substrate 209. The electron interaction volume is depicted as 
teardrop 211. Some of the X-rays generated within the sampling volume 211 from the 
copper, dielectric and silicon layers in the stack have enough energy to penetrate through the 
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stack 203 so that they are emitted from the surface of the sample. For example, X-rays 213, 
215 and 217 penetrate from different layers of the stack 203 penetrate through the stack and 
are emitted from the surface of the sample. 

In preferred embodiments, the incident e-beam will have a high enough energy to 
5 travel through the different layers of interest of the film stack and to generate X-rays in the 
sample structure of interest. This e-beam energy will depend on the sample composition and 
thickness and on how deep the structures of interest lie. To analyze most structures of 
interest, such as copper vias or contacts and lines of the uppermost stack of a typical copper 
integrated circuit sample, this e-beam energy is preferably no less than about 15 keV. 

10 In preferred embodiments, the diameter of the incident e-beam is tightly focused over 

the structure of interest. Features of interest in integrated circuits in copper vias are very 
small - for example, currently as small as 150 nanometers (nm) in diameter. Therefore, to 
characterize such small features, the e-beam diameter is preferably sufficiently focused to 
resolve them. In some embodiments, the e-beam can be rastered over the region of interest 

15 to collect data. In other embodiments, the e-beam is held over one region of the sample, 
referred to as a "spot mode" method. 

As described previously, copper has two dominant X-ray emissions, higher energy 
Cu K (containing Ka and K(i) X-rays and lower energy Cu L X-rays. In one embodiment of 
the present invention, the dual X-ray line emission of copper is used to characterize different 

20 depths of a copper-containing sample. This is explained by Figure 3A-3D, with each figure 
showing a cross section view of a focused e-beam 301 bombarding the surface of part of an 
integrated circuit sample 303 that has a copper via 305 surrounded by dielectric material 
307. The e-beam trajectory has enough energy to travel through the sample to generate X- 
rays from the excitation volume 309 of the sample. Each of the four figures shows different 

25 scenes from the same e-beam bombardment event of the same sample at the same time. 

Figure 3 A and 3B depict X-rays emitted from the surface of the sample 311. Figure 
3 A shows how Cu Ka X-rays are radiated from the surface of the sample 311. Since the X- 
rays radiate from all directions within the sample, some Cu Ka X-rays 317 get scattered 
within the sample, while other Cu Ka X-rays 3 1 3 radiate from the surface and get detected 
30 by a detector 315. Figure 3B shows how Cu La X-rays are radiated from the surface of the 
sample 311. Cu La X-rays have less energy than Cu Ka X-rays, so the Cu La X-rays are 
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more likely to be scattered, such as Cu La X-rays 32 1, and be absorbed by the sample more 
than Cu Ka X-rays. However, since the X-rays are also radiating from close to the surface 
of the sample 311, some Cu La X-rays 319 are able to radiate unimpeded from the surface 
and get detected by the detector 315. 

5 Figure 3C and 3D depict X-rays emitted from the lower portion of the sample 

volume 323. Figure 3C shows how Cu Ka X-rays are radiated from the lower portion of the 
sample volume 323. As in Figure 3 A, the X-rays radiate from all directions within the 
sample and some of the Cu Ka X-rays 327 get scattered within the sample. Since the X-rays 
are radiating from lower within the sample, more of the Cu Ka X-rays 327 are absorbed by 

10 the sample and get scattered within the sample, thereby letting less Cu Ka X-rays 325 to 
emerge from within the sample to get detected by the detector 315, compared to the Cu Ka 
X-rays from the surface of the sample 311 (depicted in Figure 3 A). Figure 3D shows how 
Cu La X-rays are radiated from the lower portion of the sample volume 323. Cu La X-rays 
have less energy than Cu Ka X-rays, so the Cu La X-rays (e.g., 329) are more likely to be 

15 scattered and be absorbed by the sample than the Cu Ka X-rays. Since the Cu La X-rays are 
radiating deep within the lower portion of the volume 323, there are fewer Cu La X-rays that 
can emerge, such as X-rays 331, compared to Cu La X-rays that can emerge (such as X-rays 
319 of Figure 3B) from the surface of the sample 311. This means that relatively more of 
the Cu La X-rays are absorbed by the sample at deeper emission depths than Cu Ka X-rays. 

20 This difference in absorption of the two types of copper X-rays within copper is 

illustrated in the graph of Figure 7. The figure illustrates the percentage of X-rays emitted 
701 (vertical axis) with respect to emission depth 703 (horizontal axis) in micrometers (|xm) 
within a copper on silicon oxide stack. This percentage of emitted X-rays is normalized to 
the total amount of generated X-ray signals from the sample. Figure 7 shows the emitted X- 

25 ray as detected from an X-ray detector at a take-off angle of 30 degrees with respect to the 
sample surface. The line for Cu La X-ray emission line 705 shows exponential decay with 
increased emission depth, with substantially no emissions detected by 0.50|im sample depth. 
The line for Cu Ka X-ray emission line 707 shows a less dramatic and more linear decay 
than the Cu La X-ray emission. 

30 As stated previously, a typical application of the present invention is the 

analysis/characterization of vias and contacts within metal interconnect structures. In a 
preferred embodiment, the metal interconnect structures are copper lines, vias and contacts 
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within an integrated circuit. A description of how to analyze and characterize different types 
of voids in copper integrated circuits will be discussed in detail further below. 

In alternative embodiments, one can analyze voids in structures composed of other 
metals such as aluminum, tungsten, tantalum, titanium and chromium, as well as non-metals 
such as silicon, silicon dioxide, polyimide, silicides and low k insulation materials. In other 
embodiments, other potentially defective areas such as physical shorts, particles or chemical 
impurities can be analyzed and characterized. In other embodiments, the present techniques 
can be used to analyze the localized film thickness variations of one or more layers of 
material in a film stack. For example, the present techniques can be used to analyze the 
presence of and amount of dishing and erosion phenomenon occurring as a result of 
imperfect CMP processes on metal films. 

CHARACTERIZING VOIDS IN COPPER INTERCONNECTS 

As discussed previously, the emission of Cu K and Cu L from copper can be used to 
characterize different depths of a copper-containing sample. This characterization of copper 
material with respect to depth can be used to detect the presence of voids and to characterize 
the qualities, locations and sizes of the voids. Some techniques for doing this are described 
below. 

Void characterization of Cu interconnect structure can primarily be based on the 
following emissions: low energy Cu La X-rays, high energy Cu Ka X-rays and Si Ka X- 
rays. As described previously, the low energy Cu La X-rays are most strongly detected 
when they are emitted from the sample surface or upper portion of the sample and decrease 
dramatically when emitted from deeper regions of the sample. The high energy Cu Ka X- 
rays are strongly detected when they are emitted from the sample surface and also when they 
are emitted from deeper bulk portions of the sample. 

So if a void were to exist near the sample surface, there is less copper material near 

the sample surface to emit both Cu La X-ray and Cu Ka X-rays. However, the relative 

amount of Cu La X-ray emission will decrease more than the amount of Cu Ka X-ray 

emission. Likewise, if a void were to exist in the deeper portions of the sample, there is less 

copper material in the deeper portions of the sample to emit both types of X-rays. In this 

case, there are already only small amounts of Cu La X-ray emission and the relative 

decrease of the Cu Ka X-ray emission is more dramatic. So detection of Cu La X-rays may 
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be used to characterize the upper portions of the sample while detection of Cu Ka X-rays 
may be used to characterize the deeper portions of the sample. By taking the ratio of 
detected Cu La X-ray emission with respect to Cu Ka X-ray emission of a sample region, 
one can determine whether the void is near the surface of the sample. 

In addition to copper X-rays, Si Ka X-rays will be emitted from the silicon dioxide 
surrounding the copper interconnects of the integrated circuit. Detection of Si Ka X-rays 
may be used to emphasize the variations of the Cu La X-rays and Cu Ka X-rays with respect 
to the surroundings. So one can determine more reliable information if a ratio of detected 
Cu La X-ray and Cu Ka X-ray emissions are taken with respect to Si Ka X-ray emission. 

This ratio data can be used to help determine the presence of a void(s), as well as the 
size, distribution and location of the void(s). This is accomplished by analyzing a 
substantially void-free sample, saving the X-ray emissions data (i.e. different X-ray 
emissions ratios), and comparing it to a sample with a void(s). 

This ratio comparison technique is described further in the table and drawings in 
Figure 8. The figure shows cross section drawings of four different types of copper 
trenches: one with no substantial voids ("no void") 803, and three with a number of voids 
distributed differently within the trench. Trench 805 has a number of voids uniformly 
distributed within the trench ("uniform voids"). Trench 807 has a number of voids 
distributed in deeper regions of the trench ("deep voids"). Trench 809 has a number of voids 
distributed in upper regions of the trench ("surface voids"). Silicon dioxide insulator 
surrounds all the copper trenches. A table 801 below the drawings lists the predicted change 
in X-ray ratios (Cu La/ Si Ka, Cu Ka/ Si Ka and Cu La/ Cu Ka) for each of the four types of 
voided samples. These ratio changes assume full excitation of the trench areas. 

The table 801 can be used to help characterize the voids of the unknown sample as 
being one of the three different types of voided copper trenches. For example, the Cu La/ Si 
Ka and Cu Ka/ Si Ka ratio listings for the "uniform voids" are marked as "«", which 
indicate a predicted or determined large decrease in the Cu La/ Si Ka and Cu Ka/ Si Ka 
ratios compared to the "no void" trench. The Cu Ka/ Cu Ka ratio listing for the "uniform 
voids" is marked as "=", which indicates there is no change predicted in the Cu La/ Cu Ka 
ratio compared to the "no void" trench. Each of the other types of voided trenches in the 
table can be interpreted similarly, thereby enabling one to characterize the voids that exist in 
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a sample in which the X-ray ratios have been measured. Note that for the "no void" trench 
803, all the ratios are marked as "=" since this is the trench that is used as a reference for 
distinguishing the three other types of trenches. 

Figure 9 shows a table 901 listing Cu La/ Si Ka, Cu Ka/ Si Ka and Cu La/ Cu Ka 
ratios for voids within more complex types of interconnect structures compared to the simple 
trench voids in Figure 8. Drawing 903 shows an interconnect structure with regions of 
different types of voids. There are six listed complex voided structures in table 901: 
"missing via" (depicted as 905), "slit void" or "uniform void" (depicted as 907), "partial via 
top void" (depicted as 909), "partial via bottom void" (depicted as 911), "trench void 
bottom" (depicted as 913) and "trench void top" (depicted as 915). Table 901 lists the 
predicted change in X-ray ratios (Cu La/ Si Ka, Cu Ka/ Si Ka and Cu La/ Cu Ka) for each 
of these six types of voided samples and can enable one to characterize the voids that exist in 
a sample in which the X-ray ratios have been measured. 

EXAMPLE ANALYSIS PROCESS 

Now that techniques have described for how to use copper X-ray emission to help 
determine the size, distribution and location of voids in a sample, reference will be made to 
an embodiment of a process flow employing these techniques to analyze a target 
semiconductor device. More specifically, the techniques are applied to analyzing a 
semiconductor device with copper interconnects and silicon dioxide insulator. One process 
flow is summarized in the flowcharts of Figures 1 A through 1C. 

Initially, a "setup" procedure is performed in operation 101. The setup procedure 
includes identifying a structure of interest. The structure of interest is generally defined as 
the region of metal interconnect that is to be analyzed and characterized for voids, such as a 
trench, via, contact pad or part of a line. The structure of interest may be defined in any 
suitable manner to facilitate activation of such structure of interest with a charged particle 
beam, such as an e-beam. In one example, the structure of interest is defined with a set of 
parameters. For example, if the structure of interest is a via, parameters may be copper 
thickness, via height, and via diameter. See Figure 4. In this figure, a top view 401 of a 
copper line 403 is shown and a side view 405 of the same copper line showing vias 407 
connecting the upper copper line 403 to copper lines 409 of the next lower metalization 
layer. In this example, the structure of interest is the area above and around each of the 
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copper vias 407. The parameters for this example may include the top copper line width 
41 1, the copper line thickness 413, the via height 415 and the via diameter 417. 

View 419 shows the same perspective of the structure as 405, but showing repeated 
e-beam sampling directed at these structures of interest at different sampling time intervals 
(421, 423 and 425). The resulting teardrop shaped electron interaction volumes (427, 429 
and 431, respectively) are shown. These electron interaction volumes are the regions that 
will emit X-rays. 

The setup procedure (Figure 1A, block 101) also includes establishing a set of 
irradiation parameters, i.e. the scanning parameters if the e-beam is scanned over the sample 
or e-beam position if the tool is in spot mode. That is, a predefined set of operating 
conditions of the tool (e.g., e-beam tool) is established for irradiating the structure of 
interest. In one implementation, the irradiation parameters include an e-beam irradiation 
area or pattern, e-beam energy, e-beam current, and irradiation duration. The e-beam 
irradiation condition should be optimized according to the structure of interest and the 
feature information to be extracted. That is, different scan parameters can be used for the 
same structure, depending what information is to be obtained. For example, lower incident 
beam energy may be used if a void is near the surface and a higher beam energy may be used 
if a void is buried deep is within the structure of interest.. 

Referring back to the flowchart Figure 1A, the next process block 103 includes 
irradiating reference structures with electron beam and storing the reference structure X-ray 
data (reference feature vector, <F> re f) into a database. A reference structure is a "good" 
structure of interest in that, ideally, it has substantially no voids. It is the structure to which 
other sample structures of interest are compared. The <F> re f may have any suitable format 
for facilitating characterization of the X-ray spectral data obtained from the multiple 
reference structures. In one embodiment, the <F> re f for a plurality of irradiated reference 
copper structures includes the following parameters: the mean of the absolute X-ray counts 
for Cu Ka, the mean of the absolute X-ray counts of Cu La, the mean of the absolute X-ray 
counts of Cu total (includes Cu Ka, CuKfJ and CuLa), the mean of the absolute X-ray counts 
of Si Ka, the mean of the ratios of Cu La/Cu Ka, the mean of the ratios of CuTotal/Si Ka, 
the mean of the ratios of Cu La/Si Ka, and the standard deviations of the proceeding 
absolute and ratio X-ray counts. The <F> re f values will vary for each type of structure of 
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interest. For example, the <F> re f for a copper via will differ from the <F> re f of a copper 
trench. 

One method of obtaining the <F> re f is by irradiating a sample using e-beam 
techniques described previously. The electron irradiation may be done by scanning an e- 

5 beam (typically rastered) or irradiating a single spot (spot mode) over the area of the 
reference structure, detecting the X-ray emission data and storing it into an analysis or 
processor unit. To ensure that a reliable <F> re f is calculated, it is preferable that a sufficient 
amount of multiple measurements of the same reference structures are taken so that X-ray 
emission data can be averaged and standard deviations can be calculated. Preferably, 

10 the<F> re f is calculated from a mean of number of measurements at different locations but 
with the same structures sufficient. Each measurement may contain multiple raster scans 
over a certain acquisition time. Furthermore, the ratios of the X-ray emission data are 
preferred to the absolute intensity for each elemental species. This is because ratios are 
generally more reliable than the absolute X-ray emission intensity. While the absolute 

15 intensity of each element is subject to current and detector efficiency fluctuation, X-ray 
intensity ratios of two elements is not. Additionally, the scan parameters (e.g., landing 
energy, scan area, beam current density, etc) established in operation 101 are set for each 
reference scan to minimize differences due to variable operating conditions. 

Any suitable electron beam induced X-ray microanalysis system may be utilized to 
20 practice and/or implement the techniques of the present invention. An eV300 automated e- 
Beam wafer review system available from KLA-Tencor Corporation of San Jose, California 
may be used. Figure 10 is a diagrammatic representation of a system utilizing an electron 
beam induced X-ray microanalysis test system according to one embodiment of the present 
invention. The system represented in Figure 10 includes a beam generator 1000, which 
25 directs an e-beam 1001 at the sample 1003. The sample 1003 in the example depicted in 
Figure 10 is a semiconductor wafer having a silicon substrate 1009 upon which a silicon 
dioxide layer 1011 is patterned with a plurality of trenches 1013 filled with copper. Before 
directing e-beam 1001 to sample 1003 and collecting X-ray data, the locations of the 
structures of interest are established by using standard imaging techniques. In preferred 
30 embodiments, the beam spot size is small enough during imaging for resolving the structures 
of interest. The structures of interests can be irradiated by using a raster scan or by 
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switching to spot mode to acquire x-ray data. In spot mode, the beam spot size may be 
different from the spot size of raster scan. Furthermore, the beam spot size can be varied. 

At least one X-ray detector is used to collect the X-rays emitted from the surface of the 
sample. The system in Figure 10 includes four X-ray detectors 1005 positioned above the 

5 sample. Any suitable number and type of detector for measuring X-rays at specific energy 
levels may be utilized. One type of detector is an Energy Dispersive X-ray spectrometer 
(EDX) system, which collects photons in a wide spectrum of energies. EDX systems are 
capable of collecting a greater range of signals. As a result however, EDX detectors also 
collect photons having energies surrounding the characteristic photon energies. This causes 

10 EDX detectors to have lower signal to noise ratios. Another type of detector is a wavelength 
dispersive system (WDS) X-ray detector. Several suitable embodiments of WDS X-ray 
detectors are described further in co-pending U.S. Patent Application No. 09/695,726, filed 
23 October 2000, which application is incorporated herein by reference in its entirety. 

In the system depicted in Figure 10, each of the X-ray detectors is coupled with an 
15 analysis or processor unit 1007. The analysis/processor unit 1007 can be configured to 
analyze the data collected by the X-ray detectors 1005 to generate X-ray ratio data of the 
elemental species in the sample, such as the Cu La/ Si Ka, Cu Ka/ Si Ka and Cu La/ Cu Ka 
described previously. The analysis/processor unit 1007 may take the form of any suitable 
processing or computing system, such as a workstation. 

20 Another method of obtaining the reference vector (<F> re f) is by generating simulated 

data, which does not require actual scanning of samples. Simulated data can be calculated 
by statistical methods to generate a predicted <F> re f. Modifying data from previously 
scanned "real" data from similar reference structures can also be used to approximate a 
predicted <F> re f. The <F> re f can also be obtained by using a combination of simulated and 

25 "real" data. 

Referring back to the flowchart Figure 1A, the next process block 105 includes 
scanning structures known to contain void(s) and storing this known voided structure X-ray 
data into the database. These known voided structures have the same parameters as the 
reference structure. The X-ray data from these scans may then be used to define a defective 
30 feature vector (<F> de f). The <F> de f values will vary for each type of void(s) present in the 
structure. As with the reference structure data, it is preferable that a sufficient amount of 
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scans of voided structures are taken so that X-ray emission data can be averaged and 
standard deviations can be calculated. The <F>a e f preferably includes the same parameters 
as the <F> re f, for example, as described above with respect to operation 103. The<F> re f is 
preferably calculated from measurements with good signal-to-noise ratio. Furthermore, to 

5 take out variations in differences of the e-beam condition such as beam energy and beam 
current that occur at each measurement, it is preferable to use ratios of the X-ray emission 
data for each elemental species. Additionally, the same scan parameters (e.g., landing 
energy, scan area, beam current density, etc) established in step 101 are set for each known- 
voided scan, as well as each reference scan. Like <F> re f, the <F>d e f data is stored in the 

10 analysis or processor unit described previously. 

The next process block 107 in the flowchart of Figure 1A includes establishing rules 
for the characterization of voids based on the stored data obtained from the irradiated 
reference and known- voided structures (e.g., <F> re f and <F> det ). Figure IB is a flowchart 
illustrating the details of block 107, Figure 1A in accordance with one embodiment of the 

15 present invention. Initially, the data that was stored from the reference and known voided 
structures (e.g., <F> re f and <F> de t) are analyzed in operation 141. The following operations 
143, 145, and 147 may be performed in any suitable order. In the illustrated embodiment, in 
block 143 rules for a "Process Variation Test" or PVT rules are established based on this 
data analysis. PVT rules are established for later determining whether an unknown structure 

20 has X-ray data that is so out of specification that the unknown structure cannot be 
characterized. In other words, an unknown structure may deviate so far from the design data 
that the X-ray data from such structure cannot be further analyzed to determine whether the 
structure has a void. 

In general terms, the X-ray data from the reference and known-void data is analyzed 
25 to determine an acceptable range for X-ray data obtained for an unknown voided structure. 
For instance, one method for determining process variation involves evaluating Figures of 
Merit that describe how much variation exists in collected data sets. These Figures of Merit 
are often referred to as Process Variation Test Figures of Merit, or PVT Figures of Merit. 
One method of defining PVT Figures of Merit is to calculate the coefficient of variation 
30 using the data sets obtained for the same structure but over multiple locations on the wafer. 
Typical values of Figures of Merit for normal process variation within process control can be 
determined experimentally for each layer of the sample. The typical values can be set as 
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default thresholds to distinguish between normal process variation and out-of-control 
process variations. 

After the PVT rules are established, rules for determining the presence or 
"localization" of a void within an unknown structure may then be established based on the 
analysis of the data from the reference and/or known voided structures in operation 145. 
Such rules are referred to herein as "Defect Localization Test" or DLT rules. The DLT rules 
may be established in any suitable manner. In some embodiments, DLT rules are 
determined using a combination of experimental data, theoretical analyses, and 
computational modeling. For instance, it may be determined that if the X-ray absolute count 
or ratio data from an unknown voided structure is within a predefined range, a void is 
present within such structure. The range may be determined experimentally. In one 
implementation, a reasonable initial rule (e.g., as listed below) may be determined and then 
verified by inspecting a plurality of known voided structures to determine whether this rule 
reliably classifies each known voided structure as having a void. An example set of DLT 
rules may include: 

if any or a specified subset of the mean X-ray count or ratio value from the unknown 
structure is more than two standard deviations (e.g., as obtained from the feature vector of 
the reference structures) from the corresponding X-ray count or ratio value from the 
reference structure, the unknown structure is characterized as having a void (Note that the 
rule can be tightened by using three or higher standard deviations to provide more 
sensitivity); and 

if any or a specified subset of the mean X-ray count or ratio value from the unknown 
structure is not more than two standard deviations from the corresponding X-ray count or 
ratio value from the reference structure, the unknown structure is characterized as not having 
a void. 

After the DLT rules are established for the structure of interest, rules for binning or 
classifying the localized void(s) are established based on the analysis of the X-ray data from 
the reference and/or known voided structures in operation 147. These rules are referred 
herein as "Defect Binning Test" or DBT rules. These DBT rules may be determined by 
analyzing how much particular subset(s) of the X-ray data obtained from the known void 
structures differ from corresponding data from the reference structures. The tables of 

KLA1P099/P1085, P1086 20 



Figures 8 and 9 represent reference and known-voided data obtained from trench type 
structures and from complex type structures, respectively. Note that these tables are shown 
as exemplary methods for classifying voids using an example set of DLT rules and are not 
meant to limit the invention to any particular method or DLT rules. In Figures 8 and 9, the 
5 following symbols are defined as follows: 



Symbol 


Description 


Definition 


« 


"much less than" 


Unknown structure ratio is more than seven standard 
deviations less than reference structures ratio. 


< 


"less than" 


Unknown structure ratio is more than four but less 
than seven standard deviations less than reference 
structures ratio. 


< 


"somewhat less than" 


Unknown structure ratio is more than two but less 
than four standard deviations less than reference 
structures ratio. 




"equal to" 


Unknown structure ratio is less than two standard 
deviations greater or less than reference structures 
ratio. 


> 


"somewhat greater than" 


Unknown structure ratio is more than two but less 
than four standard deviations greater than reference 
structures ratio. 


> 


"greater than" 


Unknown structure ratio is more than four but less 
than seven standard deviations greater than reference 
structures ratio. 


» 


"much greater than" 


Unknown structure ratio is more than seven standard 
deviations greater than reference structures ratio. 



Therefore, by using the rules indicated in the tables of Figures 8 and 9, one could 
characterize voids within different types of samples. For example, referring to the table 801 
10 of Figure 8, the following example rules may be established for characterizing unknown 
voided trench type structures: 

if the ratios of CuLa/SiKot obtained from an unknown structure is "much less than" 
(more than seven standard deviations less than) the corresponding ratio of CuLot/SiKa 
obtained from the reference structure AND if the ratio of CuKcc/SiKcc obtained from an 
15 unknown structure is "much less than" the corresponding ratio of CuKot/SiKa obtained from 
the reference structure AND if the ratio of CuLa/CuKoc obtained from an unknown structure 
is "equal to" (within two standard deviations) the corresponding ratio of CuLa/CuKoc 
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obtained from the reference structure, the unknown trench structure contains uniform voids 
805; 

if the ratio of CuLa/SiKa obtained from an unknown structure is "equal to" the 
corresponding ratio of CuLa/SiKa obtained from the reference structure AND if the ratio of 
5 CuKa/SiKa obtained from an unknown structure is "much less than" the corresponding 
ratio of CuKa/SiKa obtained from the reference structure AND if the ratio of CuLa/CuKoc 
obtained from an unknown structure is "much greater than" (more than seven standard 
deviations greater than) the corresponding ratio of CuLa/CuKa obtained from the reference 
structure, the unknown trench structure contains deep voids 807; and 

10 if the ratio of CuLa/SiKa obtained from an unknown structure is "much less 

than"the corresponding ratio of CuLa/SiKa obtained from the reference structure AND if 
the ratio of CuKa/SiKa obtained from an unknown structure is "somewhat less than" (more 
than two but less than four standard deviations less than) the corresponding ratio of 
CuKa/SiKa obtained from the reference structure AND if the ratio of CuLa/CuKa obtained 

15 from an unknown structure is "less than" (more than four but less than seven standard 
deviations less than ) the corresponding ratio of CuLa/CuKa obtained from the reference 
structure, the unknown trench structure contains uniform voids 809. 

Note that these rules may vary depending on the structure that is being inspected. Of 
course, any suitable number and type of rules may be generated from the table 801 of Figure 
20 8. DBT rules may similarly be generated for complex structures. For example, referring to 
the table 901 of Figure 9, the following example rules may be established for characterizing 
unknown voided complex type structures: 

if the ratio of CuLa/SiKa obtained from an unknown structure is "much less than" 
than the corresponding ratio of CuLa/SiKa obtained from the reference structure AND if the 
25 ratio of CuKa/SiKa obtained from an unknown structure is "much less than" the 
corresponding ratio of CuKa/SiKa obtained from the reference structure AND if the ratio of 
CuLa/CuKa obtained from an unknown structure is "somewhat less than" the corresponding 
ratio of CuLa/CuKa obtained from the reference structure, the unknown trench structure 
contains a missing via type void 905; 
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if the ratio of CuLa/SiKa obtained from an unknown structure is "much less than" 
the corresponding ratio of CuLa/SiKa obtained from the reference structure AND if the 
ratio of CuKa/SiKa obtained from an unknown structure is "less than" the corresponding 
ratio of CuKa/SiKa obtained from the reference structure AND if the ratio of CuLoc/CuKoc 
5 obtained from an unknown structure is "less than" the corresponding ratio of CuLoc/CuKa 
obtained from the reference structure, the unknown trench structure contains a top portion 
partial via type void 909; and 

if the ratio of CuLa/SiKa obtained from an unknown structure is "much less than" 
the corresponding ratio of CuLa/SiKa obtained from the reference structure AND if the 
10 ratio of CuKa/SiKa obtained from an unknown structure is "less than" the corresponding 
ratio of CuK/SiK obtained from the reference structure AND if the ratio of CuLa/CuKa 
obtained from an unknown structure is "much less than" the corresponding ratio of 
CuLa/CuKa obtained from the reference structure, the unknown trench structure contains a 
top portion trench type void 915. 

15 Referring back to Figure 1A, the next process block 109 includes scanning a target 

(unknown) structure and characterizing the voids in the target structure based on the 
established rules. Figure 1C outlines a procedure for characterizing a target structure based 
on the established rules in accordance with one embodiment of the present invention. First 
in operation 161, a target structure is scanned to obtain X-ray data corresponding to at least a 

20 portion of the data collected for the reference and known voided structures. For example, 
absolute X-ray counts and ratios are obtained for CuLa, CuKa, and SiKa. Of course, the 
type of X-ray data obtained will vary with the composition of the target structure. 

It is then determined whether the quality of the data obtained from the target 
structure is acceptable based on the established PVT rules in operation 163. In some specific 

25 examples, the dividing line separating acceptable from unacceptable data are set to between 
about 0.03 to 0.05, depending on the feature (Figures of Merit) being inspected. If it is 
determined and reported that the data is unacceptable and the process variation is too high, 
the void characterization process 109 is halted in operation 164. In other words, the X-ray 
data from the target structure is too out of specification to characterize further (e.g., 

30 determine whether a void is present and classify such void if found). The data and/or target 
structure may then be further inspected to determine why the data is so out of range. 
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If it is unknown whether the scanned area contains voids, a Suspect Outlier Test can 
be conducted to find the outlier among the data set for the purpose of accurate defect 
localization. This test can be done by using any suitable standard outlier test method or 
statistics. Once an outlier is identified, the data associated with this outlier will not be used 
in the calculation of PVT Figures of Merit to evaluate the data scattering and process 
variation. If the purpose is to evaluate the data scattering and process variation without need 
to identify the exact void presence, then PVT Figures of Merit can be calculated over all the 
data. 

If the data is acceptable, it is then determined whether a void is present based on the 
established DLT rules in operation 165. The localization of void is based on the Figures of 
Merit calculated for DLT and the rules of DLT. In one embodiment, using the above 
described sample DLT rules, it may be determined that there is no void present when any or 
a specified subset of the mean X-ray count or ratio value from the unknown structure is not 
more than 2 standard deviations from the corresponding X-ray count or ratio value from the 
reference structure, the target structure is characterized as not having a void. Otherwise, the 
target structure is characterized as having a void. If a void is not present, it may be reported 
that a void is not present and the void characterization process 109 is stopped in operation 
166. 

If there is a void, the found or localized void is then classified based on the DBT 
rules in operation 167. For example, trench void(s) are classified as one or more uniform 
void(s), one or more deep void(s), or one or more surface void(s). In another application, 
complex type structure void(s) are classified as one or more missing via(s), one or more slit 
or uniform void(s), or one or more partial/top void(s), one or more partial/bottom void(s), 
one or more trench/top void(s), or one or more trench/bottom void(s). In a specific 
application using the above example DBT rules, a target trench type structure is 
characterized as having uniform voids when the ratio of CuaL/SiKoc obtained from the 
target structure is much less (e.g., less than 7 standard deviations) than the ratio of 
CuLa/SiKa obtained from the reference structure AND when the ratio of CuKa/SiKa 
obtained from the target structure is much less than the ratio of CuKa/SiKa obtained from 
the reference structure. After the target structure void(s) are characterized (e.g., qualified as 
being acceptable to characterize further, located as being present, and classified into a 
specific void type if present), the characterization process ends. 
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Although the foregoing invention has been described in some detail for purposes of 
clarity of understanding, it will be apparent that certain changes and modifications may be 
practiced within the scope of the appended claims. Therefore, the described embodiments 
should be taken as illustrative and not restrictive, and the invention should not be limited to 
5 the details given herein but should be defined by the following claims and their full scope of 
equivalents. 
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